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4 <110> APPLICANT: BASF AKTIENGESELLSCHAFT et al . 

6 <120> TITLE OF INVENTION: METHODS FOR THE PREPARATION OF A FINE 

7 CHEMICAL BY FERMENTATION 
10 <130> FILE REFERENCE: BGI-160PC2 

C--> 12 <140> CURRENT APPLICATION NUMBER: US/10/579,648 
C--> 12 <141> CURRENT FILING DATE: 2006-05-18 

12 <150> PRIOR APPLICATION NUMBER: PCT/IB2003/006435 

13 <151> PRIOR FILING DATE: 2003-12-18 
15 <160> NUMBER OF SEQ ID NOS : 15 

17 <170> SOFTWARE: FastSEQ for Windows Version 4.0 

19 <210> SEQ ID NO: 1 

20 <211> LENGTH: 1660 

21 <212> TYPE: DNA 

22 <213> ORGANISM: Corynebacterium glutamicum 

24 <220> FEATURE: 

25 <221> NAME/KEY: CDS 

26 <222> LOCATION: (301) ... (1563) 
2 8 <400> SEQUENCE: 1 

29 tcggcatcct ctggggtagc gtcaacgcaa tcctcggaac cgtcatcgca gaaaacttcg 60 

30 cacctgaggt ccgctacacc ggcgctaccc tgggttacca agtcggagca gcactcttcg 120 

31 gcggtaccgc acccattatc gcagcatggc tgttcgaaat ctccggcgga caatggtggc 180 

32 caatcgccgt ctacgtcgct gcatgttgcc ttctctctgt gatcgcctcg ttcttcatcc 240 

33 aacgcgtcgc gcaccaagag aactaaaatc taagtaaaac ccctccgaaa ggaaccaccc 300 

34 atg gtg aaa cgt caa ctg ccc aac ccc gca gaa eta etc gaa etc atg 348 

35 Met Val Lys Arg Gin Leu Pro Asn Pro Ala Glu Leu Leu Glu Leu Met 

36 1 5 10 15 

38 aag ttc aaa aag cca gag cte aac gge aag aaa cga cgc eta gae tec 396 

39 Lys Phe Lys Lys Pro Glu Leu Asn Gly Lys Lys Arg Arg Leu Asp Ser 

40 20 25 30 

42 gcg etc ace ate tac gae ctg cgt aaa att get aaa cga cgc ace cca 444 

43 Ala Leu Thr lie Tyr Asp Leu Arg Lys lie Ala Lys Arg Arg Thr Pro 

44 35 40 45 

46 get gee gcg ttc gae tac ace gae gge gea gee gag gee gaa etc tea 492 
4 7 Ala Ala Ala Phe Asp Tyr Thr Asp Gly Ala Ala Glu Ala Glu Leu Ser 
48 50 55 60 

50 ate aca cgc gca cgt gaa gca ttc gaa aac ate gaa ttc cac cca gae 540 

51 lie Thr Arg Ala Arg Glu Ala Phe Glu Asn lie Glu Phe His Pro Asp 

52 65 70 75 80 

54 ate etc aag cct gca gaa cac gta gae ace ace ace caa ate ctg gge 588 

55 lie Leu Lys Pro Ala Glu His Val Asp Thr Thr Thr Gin lie Leu Gly 

56 85 90 95 

58 gga ace tec tec atg cca ttc gge ate gca cca acc gge ttc ace cgc 636 

59 Gly Thr Ser Ser Met Pro Phe Gly lie Ala Pro Thr Gly Phe Thr Arg 
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60 100 105 110 

62 etc atg cag acc gaa ggt gaa ate gca ggt gee gga get gea gge get 684 

63 Leu Met Gin Thr Glu Gly Glu He Ala Gly Ala Gly Ala Ala Gly Ala 

64 115 120 125 

66 gca gga att cct ttc acc ctg tec acc etg ggc act acc tee ate gaa 732 

67 Ala Gly He Pro Phe Thr Leu Ser Thr Leu Gly Thr Thr Ser He Glu 

68 130 135 140 

70 gac gtc aag gee acc aac ccc aac ggc cga aac tgg ttc cag etc tac 780 

71 Asp Val Lys Ala Thr Asn Pro Asn Gly Arg Asn Trp Phe Gin Leu Tyr 

72 145 150 155 160 

74 gtc atg cgc gac cgc gaa ate tec tac ggc etc gtc gaa cgc gca gee 828 

75 Val Met Arg Asp Arg Glu He Ser Tyr Gly Leu Val Glu Arg Ala Ala 

76 165 170 175 

78 aaa gca gga ttc gac ace etg atg ttc acc gtg gat acc ccc ate gee 876 

79 Lys Ala Gly Phe Asp Thr Leu Met Phe Thr Val Asp Thr Pro He Ala 

80 180 185 190 

82 gge tac cgc ate cgc gat tee cgc aac gga ttc tec ate ecg cea cag 924 

83 Gly Tyr Arg He Arg Asp Ser Arg Asn Gly Phe Ser He Pro Pro Gin 

84 195 200 205 

86 etg ace cea tec ace gtg etc aat gca ate cea cgc cea tgg tgg tgg 972 

87 Leu Thr Pro Ser Thr Val Leu Asn Ala He Pro Arg Pro Trp Trp Trp 

88 210 215 220 

90 ate gac ttc ctg acc acc cea acc ett gag ttc gca tec ctt tec teg 1020 

91 He Asp Phe Leu Thr Thr Pro Thr Leu Glu Phe Ala Ser Leu Ser Ser 

92 225 230 235 240 

94 ace gge gga acc gtg gge gae etc etc aac tec geg atg gat eee ace 1068 

95 Thr Gly Gly Thr Val Gly Asp Leu Leu Asn Ser Ala Met Asp Pro Thr 

96 245 250 255 

98 att tet tac gaa gae etc aag gtc ate egt gaa atg tgg cea gge aag 1116 

99 He Ser Tyr Glu Asp Leu Lys Val He Arg Glu Met Trp Pro Gly Lys 

100 260 265 270 

102 etc gta gtc aag ggt gtc cag aac gtt gaa gac tec gtc aaa etc etc 1164 

103 Leu Val Val Lys Gly Val Gin Asn Val Glu Asp Ser Val Lys Leu Leu 

104 275 280 285 

106 gac caa ggc gtc gac ggc etc ate etc tec aac cac ggt ggc egt caa 1212 

107 Asp Gin Gly Val Asp Gly Leu He Leu Ser Asn His Gly Gly Arg Gin 

108 290 295 300 

110 etc gae cgc gea cea gtc cea ttc cac etc etg cea cag gta cgc aag 1260 

111 Leu Asp Arg Ala Pro Val Pro Phe His Leu Leu Pro Gin Val Arg Lys 

112 305 310 315 320 

114 gaa gtc gga tet gaa eca ace ate atg ate gae ace gge ate atg aae 1308 

115 Glu Val Gly Ser Glu Pro Thr He Met He Asp Thr Gly He Met Asn 

116 325 330 335 

118 gge gee gae ate gtc gea gee gta gee atg gge get gae ttc acc etc 1356 

119 Gly Ala Asp He Val Ala Ala Val Ala Met Gly Ala Asp Phe Thr Leu 

120 340 345 350 

122 ate ggt egt gee tac etc tac gga etc atg gee gga gge cgc gaa gge 1404 

123 He Gly Arg Ala Tyr Leu Tyr Gly Leu Met Ala Gly Gly Arg Glu Gly 

124 355 360 365 
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126 


gtc 


gac 


cgc 


acc 


ate 


gee 


att 


etc 


cgc 


age 


gag 


ate 


acc 


cgc 


acc 


atg 


127 


Val 


Asp 


Arg 


Thr 


He 


Ala 


He 


Leu Arg 


Ser 


Glu 


He 


Thr Arg 


Thr 


Met 


128 




370 










375 










380 










130 


get 


etc 


etc 


ggt 


gtt 


tec 


tec 


etc 


gaa 


gaa 


etc 


gag 


cea 


cgc 


cac 


gtc 


131 


Ala 


Leu 


Leu 


Gly 


Val 


Ser 


Ser 


Leu 


Glu 


Glu 


Leu 


Glu 


Pro Arg 


His 


Val 


132 


385 










390 










395 










400 


134 


acc 


eag 


ctg 


gee 


aag 


atg 


gtt 


cea 


gtt 


tct 


gac 


gea 


act 


cgt 


tct 


gea 


135 


Thr 


Gin 


Leu 


Ala 


Lys 


Met 


Val 


Pro 


Val 


Ser Asp Ala 


Thr Arg 


Ser 


Ala 


136 










405 










410 










415 




138 


gcg 


gcg 


gag 


att 


taa 


aagtttctct ccttagctat taaaaggtgc ccatccgttt 


139 


Ala 


Ala 


Glu 


He 


























140 








420 


























142 


ggatgggcac cttctcgttt cttgcaatcg gcatattcag 


teaaaaaatg ttgaaat 


144 


<210> SEQ ID NO: 


: 2 
























145 


<211> LENGTH: 420 
























146 


<212> TYPE: 


PRT 


























147 


<213> ORGANISM: 


Corynebac t e r i um 


glut ami cum 












149 


<400> SEQUENCE: 


2 
























150 


Met 


Val 


Lys 


Arg 


Gin 


Leu 


Pro 


Asn 


Pro 


Ala 


Glu 










1*1 C L> 


151 


1 








5 










10 










15 




152 


Lys 


Phe 


Lys 


Lys 


Pro 


Glu 


Leu 


Asn 


Gly 


Lys 


Lys 




Arg 


Leu 


Asp 


Ser 


153 








20 










25 










30 






154 


Ala 


Leu 


Thr 


He 


Tyr 


Asp 


Leu 


Arg 


Lys 


He 


Ala 


Lys 


Arg 


Arg 


Thr 


Pro 


155 






35 










40 










45 








156 


Ala 


Ala 


Ala 


Phe 


Asp 


Tyr 


Thr 


Asp 


Gly 


Ala 


Ala 


Glu 


Ala 


Glu 


Leu 


Ser 


157 




50 










55 










60 










158 


lie 


Thr 


Arg 


Ala 


Arg 


Glu 


Ala 


Phe 


Glu 


Asn 


He 


Glu 


Phe 


His 


Pro 


Asp 


159 


65 










70 










75 










80 


160 


He 


Leu 


Lys 


Pro 


Ala 


Glu 


His 


Val 


Asp 


Thr 


Thr 


Thr 


Gin 


He 


Leu 


Gly 


161 










85 










90 










95 




162 


Gly 


Thr 


Ser 


Ser 


Met 


Pro 


Phe 


Gly 


He 


Ala 


Pro 


Thr 


Gly 


Phe 


Thr 


Arg 


163 








100 










105 










110 






164 


Leu 


Met 


Gin 


Thr 


Glu 


Gly 


Glu 


He 


Ala 


Gly 


Ala 


Gly 


Ala 


Ala 


Gly 


Ala 


165 






115 










120 










125 








166 


Ala 


Gly 


He 


Pro 


Phe 


Thr 


Leu 


Ser 


Thr 


Leu 


Gly 


Thr 


Thr 


Ser 


He 


Glu 


167 




130 










135 










140 










168 


Asp 


Val 


Lys 


Ala 


Thr 


Asn 


Pro 


Asn 


Gly 


Arg 


Asn 


Trp 


Phe 


Gin 


Leu 


Tyr 


169 


145 










150 










155 










160 


170 


Val 


Met 


Arg 


Asp 


Arg 


Glu 


He 


Ser 


Tyr 


Gly 


Leu 


Val 


Glu 


Arg 


Ala 


Ala 


171 










165 










170 










175 




172 


Lys 


Ala 


Gly 


Phe 


Asp 


Thr 


Leu 


Met 


Phe 


Thr 


Val 


Asp 


Thr 


Pro 


He 


Ala 


173 








180 










185 










190 






174 


Gly 


Tyr 


Arg 


He 


Arg 


Asp 


Ser 


Arg 


Asn 


Gly 


Phe 


Ser 


He 


Pro 


Pro 


Gin 


175 






195 










200 










205 








176 


Leu 


Thr 


Pro 


Ser 


Thr 


Val 


Leu 


Asn 


Ala 


He 


Pro 


Arg 


Pro 


Trp 


Trp 


Trp 


177 




210 










215 










220 










178 


He 


Asp 


Phe 


Leu 


Thr 


Thr 


Pro 


Thr 


Leu 


Glu 


Phe 


Ala 


Ser 


Leu 


Ser 


Ser 


179 


225 










230 










235 










240 


180 


Thr 


Gly 


Gly 


Thr 


Val 


Gly 


Asp 


Leu 


Leu 


Asn 


Ser 


Ala 


Met 


Asp 


Pro 


Thr 



1452 



1500 



1548 



1603 



1660 
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181 










245 










250 




255 




182 


He 


Ser 


Tyr 


Glu Asp 


Leu 


Lys 


Val 


He 


Arg 


Glu Met Trp Pro 


Gly 


Lys 


183 








260 










265 




270 






184 


Leu 


Val 


Val 


Lys 


Gly 


Val 


Gin 


Asn 


Val 


Glu 


Asp Ser Val Lys 


Leu 


Leu 


185 






275 










280 






285 






186 


Asp 


Gin 


Gly 


Val 


Asp 


Gly 


Leu 


He 


Leu 


Ser 


Asn His Gly Gly 


Ara 


Gin 


187 




290 










295 








300 






188 


Leu 


Asp 


Arq 


Ala 


Pro 


Val 


Pro 


Phe 


His 


Leu 


Leu Pro Gin Val 


Arq 


Lvs 


189 


305 










310 










315 




320 


190 


Glu 


Val 


Glv 


Ser 


Glu 


Pro 


Thr 


He 


Met 


He 


iXAX^ VJaLV ^^^7 


Met 


Asn 


191 










325 










330 




335 




192 


Gly 


Ala 


Asp 


He 


Val 


Ala 


Ala 


Val 


Ala 


Met 


Gly Ala Asp Phe 


Thr 


Leu 


193 








340 










345 




350 






194 


He 


Gly 


Arg 


Ala 


Tyr 


Leu 


Tyr 


Gly 


Leu 


Met 


Ala Gly Gly Arg 


Glu 


Gly 


195 






355 










360 






365 






196 


Val 


Asp 


Arg 


Thr 


He 


Ala 


He 


Leu 


Arg 


Ser 


Glu He Thr Arg 


Thr 


Met 


197 




370 










375 








380 






198 


Ala 


Leu 


Leu 


Gly Val 


Ser 


Ser 


Leu 


Glu 


Glu 


Leu Glu Pro Arg 


His 


Val 


199 


385 










390 










395 




400 


200 


Thr 


Gin 


Leu 


Ala 


Lys 


Met 


Val 


Pro 


Val 


Ser 


Asp Ala Thr Arg 


Ser 


Ala 


201 










405 










410 




415 




202 


Ala 


Ala 


Glu 


He 





















203 420 

206 <210> SEQ ID NO: 3 

207 <211> LENGTH: 35 

208 <212> TYPE: DNA 

209 <213> ORGANISM: Artificial Sequence 

211 <220> FEATURE: 

212 <223> OTHER INFORMATION: Oligonucleotide 

214 <400> SEQUENCE: 3 

215 gagagagaga cgcgtcccag tggctgagac gcatc 35 

217 <210> SEQ ID NO: 4 

218 <211> LENGTH: 34 

219 <212> TYPE: DNA 

220 <213> ORGANISM: Artificial Sequence 

222 <220> FEATURE: 

223 <223> OTHER INFORMATION: Oligonucleotide 

225 <400> SEQUENCE: 4 

226 ctctctctgt cgacgaattc aatcttacgg cctg 34 

228 <210> SEQ ID NO: 5 

229 <211> LENGTH: 4323 

230 <212> TYPE: DNA 

231 <213> ORGANISM: Corynebacterium glutamicum 

233 <400> SEQUENCE: 5 

234 tcgagaggcc tgacgtcggg cccggtacca cgcgtcatat gactagttcg gacctaggga 60 

235 tatcgtcgac atcgatgctc ttctgcgtta attaacaatt gggatcctct agacccggga 120 
23 6 tttaaatcgc tagcgggctg ctaaaggaag cggaacacgt agaaagccag tccgcagaaa 180 
23 7 cggtgctgac cccggatgaa tgtcagctac tgggctatct ggacaaggga aaacgcaagc 240 
238 gcaaagagaa agcaggtagc ttgcagtggg cttacatggc gatagctaga ctgggcggtt 300 
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239 ttatggacag caagcgaacc ggaattgcca gctggggcgc cctctggtaa ggttgggaag 360 

240 ccctgcaaag taaactggat ggctttcttg ccgccaagga tctgatggcg caggggatca 420 

241 agatctgatc aagagacagg atgaggatcg tttcgcatga ttgaacaaga tggattgcac 480 

242 gcaggttctc cggccgcttg ggtggagagg ctattcggct atgactgggc acaacagaca 540 

243 atcggctgct ctgatgccgc cgtgttccgg ctgtcagcgc aggggcgccc ggttcttttt 600 

244 gtcaagaccg acctgtccgg tgccctgaat gaactgcagg acgaggcagc gcggctatcg 660 

245 tggctggcca cgacgggcgt tccttgcgca gctgtgctcg acgttgtcac tgaagcggga 720 

246 agggactggc tgctattggg cgaagtgccg gggcaggatc tcctgtcatc tcaccttgct 780 

247 cctgccgaga aagtatccat catggctgat gcaatgcggc ggctgcatac gcttgatccg 840 

248 gctacctgcc cattcgacca ccaagcgaaa catcgcatcg agcgagcacg tactcggatg 900 
24 9 gaagccggtc ttgtcgatca ggatgatctg gacgaagagc atcaggggct cgcgccagcc 960 

250 gaactgttcg ccaggctcaa ggcgcgcatg cccgacggcg aggatctcgt cgtgacccat 1020 

251 ggcgatgcct gcttgccgaa tatcatggtg gaaaatggcc gcttttctgg attcatcgac 1080 

252 tgtggccggc tgggtgtggc ggaccgctat caggacatag cgttggctac ccgtgatatt 1140 

253 gctgaagagc ttggcggcga atgggctgac cgcttcctcg tgctttacgg tatcgccgct 1200 

254 cccgattcgc agcgcatcgc cttctatcgc cttcttgacg agttcttctg agcgggactc 1260 

255 tggggttcga aatgaccgac caagcgacgc ccaacctgcc atcacgagat ttcgattcca 1320 

256 ccgccgcctt ctatgaaagg ttgggcttcg gaatcgtttt ccgggacgcc ggctggatga 1380 

257 tcctccagcg cggggatctc atgctggagt tcttcgccca cgctagcggc gcgccggccg 1440 

258 gcccggtgtg aaataccgca cagatgcgta aggagaaaat accgcatcag gcgctcttcc 1500 

259 gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct 1560 

260 cactcaaagg cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg 162 0 

261 tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc 1680 

262 cataggctcc gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga 1740 

263 aacccgacag gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct 1800 

264 cctgttccga ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg 1860 

265 gcgctttctc atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag 1920 

266 ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat 1980 

267 cgtcttgagt ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac 2040 

268 aggattagca gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac 2100 

269 tacggctaca ctagaaggac agtatttggt atctgcgctc tgctgaagcc agttaccttc 2160 

270 ggaaaaagag ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt 2220 

271 tttgtttgca agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc 2280 

272 ttttctacgg ggtctgacgc tcagtggaac gaaaactcac gttaagggat tttggtcatg 2340 

273 agattatcaa aaaggatctt cacctagatc cttttaaagg ccggccgcgg ccgccatcgg 2400 

274 cattttcttt tgcgttttta tttgttaact gttaattgtc cttgttcaag gatgctgtct 2460 

275 ttgacaacag atgttttctt gcctttgatg ttcagcagga agctcggcgc aaacgttgat 2520 

276 tgtttgtctg cgtagaatcc tctgtttgtc atatagcttg taatcacgac attgtttcct 2580 

277 ttcgcttgag gtacagcgaa gtgtgagtaa gtaaaggtta catcgttagg atcaagatcc 2640 

278 atttttaaca caaggccagt tttgttcagc ggcttgtatg ggccagttaa agaattagaa 2700 

279 acataaccaa gcatgtaaat atcgttagac gtaatgccgt caatcgtcat ttttgatccg 2760 

280 cgggagtcag tgaacaggta ccatttgccg ttcattttaa agacgttcgc gcgttcaatt 2820 

281 tcatctgtta ctgtgttaga tgcaatcagc ggtttcatca cttttttcag tgtgtaatca 2880 

282 tcgtttagct caatcatacc gagagcgccg tttgctaact cagccgtgcg ttttttatcg 2940 

283 ctttgcagaa gtttttgact ttcttgacgg aagaatgatg tgcttttgcc atagtatgct 3000 

284 ttgttaaata aagattcttc gccttggtag ccatcttcag ttccagtgtt tgcttcaaat 3060 
2 85 actaagtatt tgtggccttt atcttctacg tagtgaggat ctctcagcgt atggttgtcg 3120 

286 cctgagctgt agttgccttc atcgatgaac tgctgtacat tttgatacgt ttttccgtca 3180 

287 ccgtcaaaga ttgatttata atcctctaca ccgttgatgt tcaaagagct gtctgatgct 3240 
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Input Set : F:\SEQLIST.txt 
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4 <110> APPLICANT: BASF AKTIENGESELLSCHAFT et al . 

6 <120> TITLE OF INVENTION: METHODS FOR THE PREPARATION OF A FINE 

7 CHEMICAL BY FERMENTATION 
10 <130> FILE REFERENCE: BGI-160PC2 

C--> 12 <140> CURRENT APPLICATION NUMBER: US/10/579 , 648 
C--> 12 <141> CURRENT FILING DATE: 2006-05-18 

12 <150> PRIOR APPLICATION NUMBER: PCT/IB2003/006435 

13 <151> PRIOR FILING DATE: 2003-12-18 
15 <160> NUMBER OF SEQ ID NOS: 15 

17 <170> SOFTWARE: FastSEQ for Windows Version 4.0 



ERRORED SEQUENCES 



819 <210> SEQ ID NO: 15 

820 <211> LENGTH: 7561 

821 <212> TYPE: DNA 

822 <213> ORGANISM: Corynebacteriutn glutamicum 

824 <400> SEQUENCE: 15 

825 accatttccg ttcatttaaa gacgttcgcg 

826 tcagcggttt catcactttt ttcagtgtga 

827 gccgtttgct aactcaaccg tgcgtttttt 

828 acggaagaat gatgtgcttt tgccatagta 

829 gtagccatct tcagttccag tgtttgcttc 

830 tacgtagtga ggatctctca gcgtatggtt 

831 gaactgctgt acattttgat acgtttttcc 

832 tacaccgttg atgttcaaag agctgtctga 

833 tgtttgtttg ccgtaatgtt 

834 agatgtaaat gtggctgaac 

835 tgcatcgaat ttgtcgctgt 

836 agtttcgccg actttttgat 

837 tccggctaat gcaaagacga 

838 ttgtaatggc cagctgtccc aaacgtccag 

839 ggacgaatca aattcagaaa cttgatattt 
84 0 catatcatgg cgtgtaatat 

841 ttctttcgca aacgcttgag 

842 tgttgcttgt tttgcaaact 

843 tgttagcggt ctgcttcttc 

844 taaaaaaaga cctaaaatat 

845 attttaggtc ttgcctgctt 

846 tctattagac tctcgtttgg 

847 tcataaaagg atttgcagac 

848 attctctgta ttttttatag 



Not Comply 



cgtcaatttc 
atcatcgttt 
atcgctttgc 
tgctttgtta 
aaatactaag 
gtcgcctgag 
gtcaccgtca 
tgctgatacg 
taccggagaa atcagtgtag 
ctgaccattc ttgtgtttgg 



ctttaaagac 
agaacatgta 
tgtggtagcc 



gggaaatgcc 
ttgcgcctcc 

ttttgatgtt 
cagccctcct 
gtaaggggtg 
tatcagtaac 
attgcaactg 
tacgggccta 
tttctgttgc 



gcggccagcg 
aatcgatgtg 
gtgatagttt 
gccttttgca 
ttcatttttt 
gtatgtttcc 
tgccagcagt 
catcgttcat 
gtttgaagat 
acgccaaagt 
aaacccgcgc 
gtctattttc 
aagaactaaa 
atgggcataa 



atctgtactg 
agctcaatca 
agaagttttt 
aataaagatt 
tatttgtggc 
ctgtagttgc 
aagattgatt 
ttaacttgtg 
aataaacgga 
tcttttagga 
tttttccagc 
tcatccgcat 
gcgacagtgc 
gaagagatat 
tgctgttcag 
ttatatggct 
gcggtagtaa 
gtctcctttt 
ggcaagttag 
atacactttg 
gatttacttt 
ctcttttgtt 
aaatctatct 
agttgccttt 



tgtagatgca 
taccgagagc 
gactttcttg 
cttcgccttg 
ctttatcttc 
cttcatcgat 
tataatcctc 
cagttgtcag 
tttttccgtc 
tagaatcatt 
tgtcaataga 
ttttaggatc 
cgtcagcgtt 
ttttaattgt 
ggatttgcag 
tttggttcgt 
aggttaatac 
ttatgtactg 
ttacgcacaa 
ccctttacac 
tcgacctcat 
tgatagaaaa 
gtttcttttc 
ttaatcacaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 
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849 ttcagaaaat atcataatat ctcatttcac taaataatag tgaacggcag gtatatgtga 1500 

850 tgggttaaaa aggatcgatc ctctagcgaa ccccagagtc ccgctcagaa gaactcgtca 1560 

851 agaaggcgat agaaggcgat gcgctgcgaa tcgggagcgg cgataccgta aagcacgagg 1620 

852 aagcggtcag cccattcgcc gccaagctct tcagcaatat cacgggtagc caacgctatg 1680 

853 tcctgatagc ggtccgccac acccagccgg ccacagtcga tgaatccaga aaagcggcca 1740 

854 ttttccacca tgatattcgg caagcaggca tcgccatggg tcacgacgag atcctcgccg 1800 

855 tcgggcatcc gcgccttgag cctggcgaac agttcggctg gcgcgagccc ctgatgctct 1860 

856 tcgtccagat catcctgatc gacaagaccg gcttccatcc gagtacgtgc tcgctcgatg 1920 

857 cgatgtttcg cttggtggtc gaatgggcag gtagccggat caagcgtatg cagccgccgc 1980 

858 attgcatcag ccatgatgga tactttctcg gcaggagcaa ggtgagatga caggagatcc 2040 

859 tgccccggca cttcgcccaa tagcagccag tcccttcccg cttcagtgac aacgtcgagc 2100 

860 acagctgcgc aaggaacgcc cgtcgtggcc agccacgata gccgcgctgc ctcgtcttgg 2160 

861 agttcattca gggcaccgga caggtcggtc ttgacaaaaa gaaccgggcg cccctgcgct 2220 

862 gacagccgga acacggcggc atcagagcag ccgattgtct gttgtgccca gtcatagccg 22 80 

863 aatagcctct ccacccaagc ggccggagaa cctgcgtgca atccatcttg ttcaatcatg 2340 

864 cgaaacgatc ctcatcctgt ctcttgatca gatcttgatc ccctgcgcca tcagatcctt 2400 

865 ggcggcaaga aagccatcca gtttactttg cagggcttcc caaccttacc agagggcgcc 2460 

866 ccagctggca attccggttc gcttgctgtc cataaaaccg cccagtctag ctatcgccat 2520 

867 gtaagcccac tgcaagctac ctgctttctc tttgcgcttg cgttttccct tgtccagata 2580 

868 gcccagtagc tgacattcat ccggggtcag caccgtttct gcggactggc tttctacgtg 2640 

869 ttccgcttcc tttagcagcc cttgcgccct gagtgcttgc ggcagcgtga agctagccat 2700 

870 tgtccttctg gcagttgctt gcgccgccct cgttgccacc atctggatgc cactgttcgg 2760 

871 atccttctcc gaccgcgtca accgtgcagt gctctacagg atctgtgcat ccgcaaccat 2820 

872 cgtgctgatt gtcccttact acttggtcct caacaccggc gaaatttggg cactgtttat 2880 

873 cactaccgtg attggcttcg gcatcctctg gggtagcgtc aacgcaatcc tcggaaccgt 2940 

874 catcgcagaa aacttcgcac ctgaggtccg ctacaccggc gctaccctgg gttaccaagt 3000 

875 cggagcagca ctcttcggcg gtaccgcacc cattatcgca gcatggctgt tcgaaatctc 3060 

876 cggcggacaa tggtggccaa tcgccgtcta cgtcgctgca tgttgccttc tctctgtgat 3120 

877 cgcctcgttc ttcatccaac gcgtcgcgca ccaagagaac taaaatctaa gtaaaacccc 3180 

878 tccgaaagga accacccatg gtgaaacgtc aactgcccaa ccccgcagaa ctactcgaac 3240 

879 tcatgaagtt caaaaagcca gagctcaacg gcaagaaacg acgcctagac tccgcgctca 3300 

880 ccatctacga cctgcgtaaa attgctaaac gacgcacccc agctgccgcg ttcgactaca 3360 

881 ccgacggcgc agccgaggcc gaactctcaa tcacacgcgc acgtgaagca ttcgaaaaca 3420 

882 tcgaattcca cccagacatc ctcaagcctg cagaacacgt agacaccacc acccaaatcc 3480 

883 tgggcggaac ctcctccatg ccattcggca tcgcaccaac cggcttcacc cgcctcatgc 3540 

884 agaccgaagg tgaaatcgca ggtgccggag ctgcaggcgc tgcaggaatt cctttcaccc 3600 

885 tgtccaccct gggcactacc tccatcgaag acgtcaaggc caccaacccc aacggccgaa 3660 

886 actggttcca gctctacgtc atgcgcgacc gcgaaatctc ctacggcctc gtcgaacgcg 3720 

887 cagccaaagc aggattcgac accctgatgt tcaccgtgga tacccccatc gccggctacc 3780 

888 gcatccgcga ttcccgcaac ggattctcca tcccgccaca gctgacccca tccaccgtgc 3840 

889 tcaatgcaat cccacgccca tggtggtgga tcgacttcct gaccacccca acccttgagt 3 900 

890 tcgcatccct ttcctcgacc ggcggaaccg tgggcgacct cctcaactcc gcgatggatc 3960 

891 ccaccatttc ttacgaagac ctcaaggtca tccgtgaaat gtggccaggc aagctcgtag 4020 

892 tcaagggtgt ccagaacgtt gaagactccg tcaaactcct cgaccaaggc gtcgacggcc 4080 

893 tcatcctctc caaccacggt ggccgtcaac tcgaccgcgc accagtccca ttccacctcc 4140 

894 tgccacaggt acgcaaggaa gtcggatctg aaccaaccat catgatcgac accggcatca 4200 

895 tgaacggcgc cgacatcgtc gcagccgtag ccatgggcgc tgacttcacc ctcatcggtc 4260 

896 gtgcctacct ctacggactc atggccggag gccgcgaagg cgtcgaccgc accatcgcca 4320 

897 ttctccgcag cgagatcacc cgcaccatgg ctctcctcgg tgtttcctcc ctcgaagaac 4380 
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898 tcgagccacg ccacgtcacc cagctggcca agatggttcc agtttctgac gcaactcgtt 4440 

899 ctgcagcggc ggagatttaa aagtttctct ccttagctat taaaaggtgc ccatccgttt 4500 

900 ggatgggcac cttctcgttt cttgcaatcg gcatattcag tcaaaaaatg ttgaaatcag 4560 

901 cactttcaat ttgggacatc tactcttagg agaaaagcca caaacctttc ccaccccaca 4620 

902 accgtgtgtt ctgcagtcga cccagtttag aggaaacatg agtgacttca cggaaaatac 4680 

903 ttggactgtc cactacgacg aagatggtga tttcccaaaa ttcttcaact ctctaaagga 4740 

904 acacgagcgt ctagagtcga cctgcaggca tgcaagcttg gcgtaatcat ggtcatagct 4800 

905 gtttcctgtg tgaaattgtt atccgctcac aattccacac aacatacgag ccggaagcat 4860 

906 aaagtgtaaa gcctggggtg cctaatgagt gagctaactc acattaattg cgttgcgctc 4920 

907 actgcccgct ttccagtcgg gaaacctgtc gtgccagctg cattaatgaa tcggccaacg 4980 

908 cgcggggaga ggcggtttgc gtattgggcg ctcttccgct tcctcgctca ctgactcgct 5040 

909 gcgctcggtc gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg taatacggtt 5100 

910 atccacagaa tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc 5160 

911 caggaaccgt aaaaaggccg cgttgctggc gtttttccat aggctccgcc cccctgacga 5220 

912 gcatcacaaa aatcgacgct caagtcagag gtggcgaaac ccgacaggac tataaagata 5280 

913 ccaggcgttt ccccctggaa gctccctcgt gcgctctcct gttccgaccc tgccgcttac 5340 

914 cggatacctg tccgcctttc tcccttcggg aagcgtggcg ctttctcata gctcacgctg 5400 

915 taggtatctc agttcggtgt aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc 5460 

916 cgttcagccc gaccgctgcg ccttatccgg taactatcgt cttgagtcca acccggtaag 5520 

917 acacgactta tcgccactgg cagcagccac tggtaacagg attagcagag cgaggtatgt 55 80 

918 aggcggtgct acagagttct tgaagtggtg gcctaactac ggctacacta gaagaacagt 5640 

919 atttggtatc tgcgctctgc tgaagccagt taccttcgga aaaagagttg gtagctcttg 5700 

920 atccggcaaa caaaccaccg ctggtagcgg tggttttttt gtttgcaagc agcagattac 5760 

921 gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt tctacggggt ctgacgctca 582 0 

922 gtggaacgaa aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac 5880 

923 ctagatcctt ttggggtggg cgaagaactc cagcatgaga tccccgcgct ggaggatcat 5940 

924 ccagccctga tagaaacaga agccactgga gcacctcaaa aacaccatca tacactaaat 6000 
92 5 cagtaagttg gcagcatcac ccgacgcact ttgcgccgaa taaatacctg tgacggaaga 6060 

926 tcacttcgca gaataaataa atcctggtgt ccctgttgat accgggaagc cctgggccaa 612 0 

927 cttttggcga aaatgagacg ttgatcggca cgtaagaggt tccaactttc accataatga 6180 

928 aataagatca ctaccgggcg tattttttga gttatcgaga ttttcaggag ctgatagaaa 6240 

92 9 cagaagccac tggagcacct caaaaacacc atcatacact aaatcagtaa gttggcagca 63 00 

930 tcacccgacg cactttgcgc cgaataaata cctgtgacgg aagatcactt cgcagaataa 6360 

931 ataaatcctg gtgtccctgt tgataccggg aagccctggg ccaacttttg gcgaaaatga 6420 

932 gacgttgatc ggcacgtaag aggttccaac tttcaccata atgaaataag atcactaccg 6480 

933 ggcgtatttt ttgagttatc gagattttca ggagctcttt ggcatcgtct ctcgcctgtc 6540 

934 ccctcagttc agtaatttcc tgcatttgcc tgtttccagt cggtagatat tccacaaaac 6600 

935 agcagggaag cagcgctttt ccgctgcata accctgcttc ggggtcatta tagcgatttt 6660 

936 ttcggtatat ccatcctttt tcgcacgata tacaggattt tgccaaaggg ttcgtgtaga 6720 

93 7 ctttccttgg tgtatccaac ggcgtcagcc gggcaggata ggtgaagtag gcccacccgc 6780 

938 gagcgggtgt tccttcttca ctgtccctta ttcgcacctg gcggtgctca acgggaatcc 6840 

939 tgctctgcga ggctggccgg ctaccgccgg cgtaacagat gagggcaagc ggatggctga 6900 

940 tgaaaccaag ccaaccagga agggcagccc acctatcaag gtgtactgcc ttccagacga 6960 

941 acgaagagcg attgaggaaa aggcggcggc ggccggcatg agcctgtcgg cctacctgct 702 0 

942 ggccgtcggc cagggctaca aaatcacggg cgtcgtggac tatgagcacg tccgcgaggg 7080 

943 cgtcccggaa aacgattccg aagcccaacc tttcatagaa ggcggcggtg gaatcgaaat 7140 

944 ctcgtgatgg caggttgggc gtcgcttggt cggtcatttc gctcggtacc catcggcatt 7200 

945 ttcttttgcg tttttatttg ttaactgtta attgtccttg ttcaaggatg ctgtctttga 7260 

946 caacagatgt tttcttgcct ttgatgttca gcargaagct cggcgcaaac gttgattgtt 7320 
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947 tgtctgcgta gaatcctctg tttgtcatat agcttgtaat cacgacattg tttccttytc 7380 

948 gcttgaggta cagcgaagtg tgagtaagta araggttaca tcgttaggat caagatccat 7440 

949 tcttaacaca aggccagttt tgttcagcgg cttgtatggg ccagttaaag aattataaac 7500 

950 ataaccaagc atgtaaatat cgttagacgt aatgccgtca atcgtcatta ttgatccgcg 7560 

951 g ^/ 7561 

E--> 95yB§f^tS0¥cT 
E--> 95| 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/10/579,648 



DATE: 05/26/2006 
TIME: 08:25:30 



Input Set : F:\SEQLIST, txt 

Output Set: N:\CRF4\05262006\J579648.raw 



L:12 M:270 C: Current Application Number differs. Replaced Current Application No 
L:12 M:271 C: Current Filing Date differs. Replaced Current Filing Date 
L:953 M:254 E: No. of Bases conflict, LENGTH : Input : 2 Counted:7567 SEQ:15 
L:953 M:320 E: (1) Wrong Nucleic Acid Designator, NUMBER OF INVALID KEYS:6 
L:953 M:112 C: (48) String data converted to lower case, 
M:254 Repeated in SeqNo=15 

L:955 M:252 E: No. of Seq. differs, <211> LENGTH : Input : 7561 Found:7567 SEQ:15 
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